NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Masking the Gaps: An Imputation-Free Approach to Time Series Modeling with Missing Data

Neog, Abhilash; Daw, Arka; Khorasgani, Sepideh Fatemi; Karpatne, Anuj (December 2024, NeurIPS 2024 Workshop on Time Series in the Age of Large Models)

Full Text Available
Fish-Vista: A Multi-Purpose Dataset for Understanding & Identification of Traits from Images

Mehrab, Kazi Sajeed; Maruf, M; Daw, Arka; Neog, Abhilash; Manogaran, Harish Babu; Khurana, Mridul; Feng, Zhenyang; Altintas, Bahadir; Bakis, Yasin; Campolongo, Elizabeth; et al (June 2025, CVPR)

The availability of large datasets of organism images combined with advances in artificial intelligence (AI) has significantly enhanced the study of organisms through images, unveiling biodiversity patterns and macro-evolutionary trends. However, existing machine learning (ML)-ready organism datasets have several limitations. First, these datasets often focus on species classification only, overlooking tasks involving visual traits of organisms. Second, they lack detailed visual trait annotations, like pixel-level segmentation, that are crucial for in-depth biological studies. Third, these datasets predominantly feature organisms in their natural habitats, posing challenges for aquatic species like fish, where underwater images often suffer from poor visual clarity, obscuring critical biological traits. This gap hampers the study of aquatic biodiversity patterns which is necessary for the assessment of climate change impacts, and evolutionary research on aquatic species morphology. To address this, we introduce the Fish-Visual Trait Analysis (Fish-Vista) dataset—a large, annotated collection of about 80K fish images spanning 3000 different species, supporting several challenging and biologically relevant tasks including species classification, trait identification, and trait segmentation. These images have been curated through a sophisticated data processing pipeline applied to a cumulative set of images obtained from various museum collections. Fish-Vista ensures that visual traits of images are clearly visible, and provides fine-grained labels of various visual traits present in each image. It also offers pixel-level annotations of 9 different traits for about 7000 fish images, facilitating additional trait segmentation and localization tasks. The ultimate goal of Fish-Vista is to provide a clean, carefully curated, high-resolution dataset that can serve as a foundation for accelerating biological discoveries using advances in AI. Finally, we provide a comprehensive analysis of state-of-the-art deep learning techniques on Fish-Vista.
more » « less
Free, publicly-accessible full text available June 15, 2026
MEMTRACK: A Deep Learning-Based Approach to Microrobot Tracking in Dense and Low-Contrast Environments

Sawhney, Medha; Karmarkar, Bhas; Leaman, Eric J.; Daw, Arka; Karpatne, Anuj; Behkam, Bahareh (October 2023, arXivorg)

Tracking microrobots is challenging, considering their minute size and high speed. As the field progresses towards developing microrobots for biomedical applications and conducting mechanistic studies in physiologically relevant media (e.g., collagen), this challenge is exacerbated by the dense surrounding environments with feature size and shape comparable to microrobots. Herein, we report Motion Enhanced Multi-level Tracker (MEMTrack), a robust pipeline for detecting and tracking microrobots using synthetic motion features, deep learning-based object detection, and a modified Simple Online and Real-time Tracking (SORT) algorithm with interpolation for tracking. Our object detection approach combines different models based on the object's motion pattern. We trained and validated our model using bacterial micro-motors in collagen (tissue phantom) and tested it in collagen and aqueous media. We demonstrate that MEMTrack accurately tracks even the most challenging bacteria missed by skilled human annotators, achieving precision and recall of 77% and 48% in collagen and 94% and 35% in liquid media, respectively. Moreover, we show that MEMTrack can quantify average bacteria speed with no statistically significant difference from the laboriously-produced manual tracking data. MEMTrack represents a significant contribution to microrobot localization and tracking, and opens the potential for vision-based deep learning approaches to microrobot control in dense and low-contrast settings. All source code for training and testing MEMTrack and reproducing the results of the paper have been made publicly available this https URL.
more » « less
Full Text Available
Mitigating Propagation Failures in Physics-Informed Neural Networks Using Retain-Resample-Release (R3) Sampling

Daw, Arka; Bu, Jie; Wang, Sifan; Perdikaris, Paris; Karpatne, Anuj (July 2023, Proceedings of the 40th International Conference on Machine Learning)

Despite the success of physics-informed neural networks (PINNs) in approximating partial differential equations (PDEs), PINNs can sometimes fail to converge to the correct solution in problems involving complicated PDEs. This is reflected in several recent studies on characterizing the "failure modes" of PINNs, although a thorough understanding of the connection between PINN failure modes and sampling strategies is missing. In this paper, we provide a novel perspective of failure modes of PINNs by hypothesizing that training PINNs relies on successful "propagation" of solution from initial and/or boundary condition points to interior points. We show that PINNs with poor sampling strategies can get stuck at trivial solutions if there are propagation failures, characterized by highly imbalanced PDE residual fields. To mitigate propagation failures, we propose a novel Retain-Resample-Release sampling (R3) algorithm that can incrementally accumulate collocation points in regions of high PDE residuals with little to no computational overhead. We provide an extension of R3 sampling to respect the principle of causality while solving timedependent PDEs. We theoretically analyze the behavior of R3 sampling and empirically demonstrate its efficacy and efficiency in comparison with baselines on a variety of PDE problems.
more » « less
Full Text Available
Mitigating Propagation Failures in Physics-Informed Neural Networks Using Retain-Resample-Release (R3) Sampling

Daw, Arka and; Bu, Jie; Wang, Sifan; Perdikaris, Paris; Karpatne, Anuj (July 2023, Proceedings of Machine Learning Research)

Despite the success of physics-informed neural networks (PINNs) in approximating partial differential equations (PDEs), PINNs can sometimes fail to converge to the correct solution in problems involving complicated PDEs. This is reflected in several recent studies on characterizing the "failure modes" of PINNs, although a thorough understanding of the connection between PINN failure modes and sampling strategies is missing. In this paper, we provide a novel perspective of failure modes of PINNs by hypothesizing that training PINNs relies on successful "propagation" of solution from initial and/or boundary condition points to interior points. We show that PINNs with poor sampling strategies can get stuck at trivial solutions if there are propagation failures, characterized by highly imbalanced PDE residual fields. To mitigate propagation failures, we propose a novel Retain-Resample-Release sampling (R3) algorithm that can incrementally accumulate collocation points in regions of high PDE residuals with little to no computational overhead. We provide an extension of R3 sampling to respect the principle of causality while solving timedependent PDEs. We theoretically analyze the behavior of R3 sampling and empirically demonstrate its efficacy and efficiency in comparison with baselines on a variety of PDE problems.
more » « less
Detecting and Tracking Hard-to-Detect Bacteria in Dense Porous Backgrounds

Sawhney, Medha; Karmarkar, Bhas; Leaman, Eric J.; Daw, Arka; Karpatne, Anuj; Behkam, Bahareh (June 2023, CVPR Workshop on CV4Animals 2023)

Studying bacteria motility is crucial to understanding and controlling biomedical and ecological phenomena involving bacteria. Tracking bacteria in complex environments such as polysaccharides (agar) or protein (collagen) hydrogels is a challenging task due to the lack of visually distinguishable features between bacteria and surrounding environment, making state-of-the-art methods for tracking easily recognizable objects such as pedestrians and cars unsuitable for this application. We propose a novel pipeline for detecting and tracking bacteria in bright-field microscopy videos involving bacteria in complex backgrounds. Our pipeline uses motion-based features and combines multiple models for detecting bacteria of varying difficulty levels. We apply multiple filters to prune false positive detections, and then use the SORT tracking algorithm with interpolation in case of missing detections. Our results demonstrate that our pipeline can accurately track hard-to-detect bacteria, achieving a high precision and recall.
more » « less
Full Text Available
PID-GAN: A GAN Framework based on a Physics-informed Discriminator for Uncertainty Quantification with Physics

https://doi.org/10.1145/3447548.3467449

Daw, Arka; Maruf, M.; Karpatne, Anuj (August 2021, ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD))

Full Text Available
Learning Compact Representations of Neural Networks using DiscriminAtive Masking (DAM)

Bu, Jie; Daw, Arka; Maruf, M.; Karpatne, Anuj (January 2021, Advances in neural information processing systems)

Full Text Available
Motion Enhanced Multi‐Level Tracker (MEMTrack): A Deep Learning‐Based Approach to Microrobot Tracking in Dense and Low‐Contrast Environments

https://doi.org/10.1002/aisy.202300590

Sawhney, Medha; Karmarkar, Bhas; Leaman, Eric_J; Daw, Arka; Karpatne, Anuj; Behkam, Bahareh (February 2024, Advanced Intelligent Systems)

Tracking microrobots is challenging due to their minute size and high speed. In biomedical applications, this challenge is exacerbated by the dense surrounding environments with feature sizes and shapes comparable to microrobots. Herein, Motion Enhanced Multi‐level Tracker (MEMTrack) is introduced for detecting and tracking microrobots in dense and low‐contrast environments. Informed by the physics of microrobot motion, synthetic motion features for deep learning‐based object detection and a modified Simple Online and Real‐time Tracking (SORT)algorithm with interpolation are used for tracking. MEMTrack is trained and tested using bacterial micromotors in collagen (tissue phantom), achieving precision and recall of 76% and 51%, respectively. Compared to the state‐of‐the‐art baseline models, MEMTrack provides a minimum of 2.6‐fold higher precision with a reasonably high recall. MEMTrack's generalizability to unseen (aqueous) media and its versatility in tracking microrobots of different shapes, sizes, and motion characteristics are shown. Finally, it is shown that MEMTrack localizes objects with a root‐mean‐square error of less than 1.84 μm and quantifies the average speed of all tested systems with no statistically significant difference from the laboriously produced manual tracking data. MEMTrack significantly advances microrobot localization and tracking in dense and low‐contrast settings and can impact fundamental and translational microrobotic research.
more » « less
Physics-Guided Architecture (PGA) of Neural Networks for Quantifying Uncertainty in Lake Temperature Modeling

https://doi.org/10.1137/1.9781611976236.60

Daw, Arka Thomas (January 2020, Proceedings of the SIAM International Conference on Data Mining)
null (Ed.)
Full Text Available

Search for: All records